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Abstract 

The formalism of synchronous tree-adjoining grammars, a variant of 
standard tree-adjoining grammars (TAG), was intended to allow the use 
of TAGs for language transduction in addition to language specification. 
In previous work, the definition of the transduction relation defined by a 
synchronous TAG was given by appeal to an iterative rewriting process. 
The rewriting definition of derivation is problematic in that it greatly 
extends the expressivity of the formalism and makes the design of parsing 
algorithms difficult if not impossible. 

We introduce a simple, natural definition of synchronous tree-adjoining 
derivation, based on isomorphisms between standard tree-adjoining deriva- 
tions, that avoids the expressivity and implementability problems of the 
original rewriting definition. The decrease in expressivity, which would 
otherwise make the method unusable, is offset by the incorporation of 
an alternative definition of standard tree- adjoining derivation, previously 
proposed for completely separate reasons, thereby making it practical to 
entertain using the natural definition of synchronous derivation. Nonethe- 
less, some remaining problematic cases call for yet more flexibility in the 
definition; the isomorphism requirement may have to be relaxed. It re- 
mains for future research to tune the exact requirements on the allowable 
mappings. 

Keywords: Synchronous tree-adjoining grammars, weak-generative ca- 
pacity, machine translation, natural-language semantics. 
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1 Introduction 



The formalism of synchronous tree-adjoining grammars (Shieber and Schabes, 



1990), a variant of standard tree-adjoining grammars (TAG), was intended to 



allow the use of TAGs for language transduction in addition to language spec- 
ification. Synchronous TAGs specify relations between language pairs; each 
language is specified with a standard TAG, and the pairing between strings 
in the separate languages is specified by synchronizing the two TAGs through 
linking pairs of elementary trees. 

This paper concerns the formal defi nitions underlying synchro nous tree- 
adjoining grammars. In previous work ( [Shieber and Schabes, 199C ) , the def- 
inition of the transduction relation defined by a synchronous TAG was given 
by appeal to an iterative rewriting process, much like the iterative rewriting 
of sentential forms defined by a context-free grammar except that the syntac- 
tic objects generated by the rewriting process were derived trees rather than 
strings. This sort of rewriting definition of derivation is problematic for several 
reasons. First, the weak-generative expressivity of TAGs is increased through 
the synchronization in the sense that the projection of the string pairs onto a 
single component, although the strings in that component are specified with 
a TAG, may not form a tree-adjoining language (TAL). Second, the lack of a 
simple recursive characterization of the derivation — a role filled by derivation 
trees for standard TAGs — makes the design of parsing algorithms difficult if 
not impossible. 

In this paper, we describe how synchronous TAG derivation can be redefined 
so as to eliminate these problems. The redefinition relies on an independent re- 
definition of the notion of tree-adjoining derivation (Schabes and Shieber, 1994) 
that was motivated completely independently of worries about the generative 
capacity of synchronous TAGs, but which happens to solve this problem in an 
elegant manner. Furthermore, it allows for existing parsing algorithms to be 
generalized to synchronous TAG transduction in a natural way. However, be- 
cause certain derivations in the rewriting sense have no analogue under the new 
definition, some linguistic analyses may no longer be statable. We comment on 
some possible negative ramifications of this fact. 



2 The Rewriting Definition of Derivation 

The original definition of derivation for synchronous TAGs was based on the 
iterative rewriting of one derived tree pair into another. In this section, we 
provide a more precise description of this approach to derivation, along with a 
discussion of its problems. First, however, we digress to discuss some notational 
issues. 
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2.1 Notation 



We assume in this and later sections a general familiarity with tree-adjoining 



grammars and their formal foundations, as described, for instance, by Vijay- 



Shanker (1987) 



We will use the following notational conventions for synchronous TAGs 
and related notions. A synchronous TAG G will be given as a set of triples 
{{L iy Ri, ^i)} where the Li and Ri are elementary trees, both initial and aux- 
iliary, forming two component TAGs Gl — {Li} and Gr — {Ri}, and ^ is 
the linking relation between tree addresses in Li and Ri. Such a grammar is in- 
tended to define a language of pairs L(G) — {(h, r,}}; the exact manner in which 
L(G) is determined is the subject of this paper. We will use the notation xl 
and xr to notate the projection of a pair x onto its left and right components, 
respectively, and generalize this notation to the first and second components 
of a triple and pointwise on sets of pairs and triples. Thus, the notations Gl 
and Gr previously introduced for the left and right component grammars are 
consistent with this notation. 

2.2 The Rewriting Process 

The rewriting process proceeds by choosing an initial tree pair {II, Ir, ^) to 
be the current derived tree pair and repeatedly performing the following steps: 

1. Choose a link ti, ^ tR between two nodes in the current derived tree pair. 

2. Choose an auxiliary tree pair (Al,Ar, ■•') from the grammar such that 
Al can adjoin at ti, in II and Ar can adjoin at tR in Ir. 

3. Modify the current derived tree pair by adjoining the chosen trees at the 
end of the chosen link, yielding the modified derived tree pair 

(lL[A L /t L ],I R [A R /t R },^") . 

This becomes the new current derived tree pair. 

The operation /[A/t] used above takes a tree /, an auxiliary tree A, and 
an address t in I and yields the result of adjoining A at address t in /. (The 
generalization to allow for substitution as well as adjunction as a primitive 
operation — both in this notation and the definition of derivation — should be 
clear.) A formal definition for this operation is given by Vijay-Shanker ( f[987| , 



page 15) and by Shieber and Schabes (1994, appendix) 



The definition of the link relation in the derived tree pair -~n" is as follows: 
All links in ^ and are included in (after suitable readdressing) except 
that the chosen link in ^ is not itself included in . Other links that impinge 
on the nodes at the end of the chosen link are retained in the derived tree pair; 
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they link to the root or foot of the newly adjoined tree as determined by whether 
the link itself is viewed as impinging on the top or the bottom of the nodeF] 



2.3 An Example of Rewriting 

By way of example, we present a sample synchronous TAG that transduces 
between a tiny fragment of English and a corresponding "logical form" semantic 
representation. 

Figure [l] shows the sample synchronous TAG composed of a set of tree pairs, 
each with a left element that is part of an English TAG fragment and a right 
element that is part of a TAG fragment for the logical form language. Thus, 
the tree pair labeled a(John) pairs a noun phrase (NP) initial tree dominating 
the proper noun 'John' with a logical term (T) dominating the constant john. 
Similarly, the tree pair a(blink) pairs a verb tree for 'blinked' with a tree for 
a formula (F) constructed as the predication of the relation (R) given by the 
symbol blink to an unspecified argument term. 

Rather than present the elements of the grammar as triples, we notate the 
links between nodes with diacritics. Thus, the a(blink) tree pair implicitly 
incorporates the link relation between tree addresses given by 

e ^ e 
2 — e 
1 — 2 

These three links are marked with the diacritics Q~], EEL and GO respectively. 

The link, for instance, connects the NP node in the left tree at address 
1 with the T node in the right at address 2, thereby allowing the two trees of 
another tree pair to operate respectively at these two nodes. Since the two nodes 
are substitution nodes (as conventionally marked by the j), the operations on 
this link would be substitutions at both ends. For example, the initial tree pair 
a(John) can operate at this link, yielding the tree pair given in Figure |[ Note 
that the remaining links in the aiplink) tree labeled Q~] and are preserved in 
the derived tree pair. 

Continuing on in this way, the resultant derived tree pair can be further 
acted upon, say, by the base pair (3(twice), whose elements can adjoin at the 
ends of the Q~] link, yielding the derived tree pair in Figure |a. The issue of how 
to handle multiple links impinging on the same node becomes relevant here, 

1 In previous work, links were typically thought of as impinging on the top of a node unless 
otherwise stated. We will retain that convention here. Further flexibility can be obtained 
by allowing each link to specify whether it links to the top or bottom of the nodes. Thus, 
the link relation ^—-i in a triple (Lj, Ri, can be thought of as being of type dom(Li) X 
{t, 1} x dom(Ri) X {t, I}, where dom(A) is the set of tree addresses in the tree A and f and 
J. serve as markers to specify whether the link impinges on the top or bottom, respectively, of 
the specified node. All of this machinery becomes superfluous, however, in the context of the 
natural definition of derivation given in Section ta. 
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Figure 1: A synchronous TAG that describes the semantic ambiguity of the 
sentence 'John intentionally blinked twice'. 

since the right end of the remaining link GO in the derived tree pair impinges on 
a node at which adjunction has just occurred. Should the link now impinge on 
the root or the foot node of the tree adjoined at that node? We place the link 
at the root, as stipulated above, so that further rewriting of the QD link, say 
with the adverbial tree pair [3(intentionally) leads to the derived tree pair in 
Figure ||b, corresponding to the string 'John intentionally blinked twice'. In the 
associated logical form, the predication of int has scope over the proposition 
twice(blink(john)), and the sentence is taken to describe a single intentional 
act of blinking twice. Had the two links been rewritten in the other order — 
link first, yielding the pair in Figure ||c, and then link G~] yielding the pair 
in Figure — the generated logical form twice(int(blink(j))) describes two 
intentional acts each of single blinkings. 

Thus, this grammar manifests the ambiguity in the sentence 'John intention- 
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Figure 2: A tree pair derived by operation of the a(John) tree at link in 
the a(blink) tree. The pair specifies the transduction between the string 'John 
blinks' and its logical form blink(john). 

ally blinked twice'. Note that the ambiguity arises from the ability to perform 
two rewriting steps at the same node, the root F node in the logical form tree 
a(blink)n corresponding to the word 'blinked'. 

2.4 Problems with the Rewriting Definition 

There are two problems with the rewriting definition of synchronous TAGs, 
having to do with the expressivity and implementability of the formalism under 
that definition. 

2.4.1 Expressivity 

Synchronous TAGs under this definition may specify non-tree-adjoining lan- 
guages. More precisely stated, given a grammar G, although, by definition, 
L(Gl) is a tree-adjoining language, L(G)l may not be. 

A simple example of a synchronous TAG that can be projected onto a non- 
TAL is given in Figure ^. This grammar specifies the string relation that pairs 
all strings of the form a n b n c n d n e n f n g n h n with the empty string. Its projection 
onto its first component is, therefore, a non-tree-adjoining language. Figure ^| 
shows the steps in the derivation of the n = 1 case. The derived tree pair for 
the n = 2 case is given in Figure [| 

2.4.2 Implementability 

In addition to the expressivity problem, there is no natural way to use a syn- 
chronous grammar for transduction under this definition. To use a synchronous 
TAG G for transduction, a given string u>l is to be transduced to wr just in 



G 




(c) (d) 



Figure 3: Derived tree pairs from the grammar of Figure [I]. The derivation of 
the meaning int(twice(blink(j))) proceeds through the derived tree pairs in (a) 
and (b). The derivation of the meaning twice(int(blink(j))) proceeds through 
the derived tree pairs in (c) and (d). 

case (wl,wr) £ L(G). This requires, intuitively speaking, parsing of the string 
wl relative to Gi yielding a derivation Dl, reconstruction of the synchronous 
(rewriting) derivation D$ from and finally, generation of the string wr 
according to this reconstructed derivation. Schematically, the process can be 
depicted as proceeding thus: 

w L — > Dl — > D s — ► w R 
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Figure 4: A synchronous TAG for a non-tree-adjoining language 
a n b n c n d n e n f n g n h n . 




Figure 5: Steps in the derivation of abedefgh. The left derived tree pair has a 
remaining obligatory adjoining constraint, which when satisfied yields the right 
derived tree pair. 

Unfortunately, the structure of a synchronous derivation bears no uniform rela- 
tionship to the kind of derivation postulated for standard TAGs. (This point is 
discussed further in the next section.) Thus, if a standard TAG parsing algo- 
rithm is used for the first step in the process (so that Dl is a traditional TAG 
derivation tree), the second step is not well defined. It is therefore not clear how 
synchronous TAGs can be effectively used under this definition of derivation. 
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Figure 6: Derived tree pair for a 2 b 2 c 2 d 2 e 2 f 2 g 2 h 2 . 

Note that this point is independent of whether the three conceptual phases 
of processing are interleaved in time. The possibility to interleave the compu- 
tations of the phases does not make their definition any simpler. 

3 The Natural Definition of Derivation 



The notion of derivation just presented for synchronous TAGs is quite non- 
standard for the TAG literature in being "flat" and rewriting oriented. Recall 
that the standard definition of TAG derivation, due to Vijay-Shanker (1987), is 
hierarchically structured in terms of derivation trees, trees that serve to charac- 
terize the operations required to construct a particular derived tree, and hence 
its yield. 

TAG derivation trees are composed of nodes, conventionally notated as 77, 
possibly in its subscripted variants. The parent of a node 77 in a derivation 
tree will be written parent(rj) , and the tree that the node marks adjunction 
of will be notated tree{rj). The tree tree(rj) is to be adjoined into its parent 
tree(parent(r\)) at an address specified on the arc in the tree linking the two; 
this address is notated addr(rf). (Of course the root node has no parent or 



address; the parent and addr functions are partial.) 

A derivation tree is well-formed if for each arc in the derivation tree from 77 
to parent{rf) labeled with addr(r)), the tree tree(rf) is an auxiliary tree that can 
be adjoined at the node addr(rj) in tree(parent(rj)). (Alternatively, tree(rj) is 
an initial tree that can be substituted at the node addr(i]) in tree(parent(rf)).) 
Furthermore, and without loss of expressivity, it is standard to exclude multiple 
sibling arcs specifying operations at the same tree address in the same tree. 
This exclusion makes the definition of the derived tree for a given derivation 
tree determinate. 

A derivation tree specifies a derived tree by virtue of the normal definitions 
for adjunction and substitution. The language of a TAG G is then the set of 
strings that are the yields of derived trees specified by derivation trees that are 
well-formed according to G. We define the function V from derivation trees to 
the derived trees they specify, according to the following recursive definition: 



V(D) 



tree(rj) 

if D is a trivial tree of one node r\ 

tree(ri)[D(D 1 )/t 1 ,V(D 2 )/t 2 , . . .,V(D k )/t k ] 
if D is a tree with root node n 

and with k child subtrees Dx,...,D k 



Here I\A\/t\, . . . , Ak/tk] specifies the simultaneous adjunction (or substitu- 
tion) of trees A\ through A k at t \ through t k , respectively, in /. Using the 



definitions of Vijay-Shanker ( 1987 ), this is well defined only as long as the U 
are disjoint, hence the need for the aforementioned exclusion. 

A definition along these lines for synchronous TAGs would be quite natural. 
We would have derivation trees that specify at each node an elementary tree 
pair, with arcs labeled by pairs of tree addresses (such that the two addresses are 
linked in the parent elementary tree pair). A function from derivation trees to 
the derived tree pairs they specify — a generalization of the T> function defined 
above — would then be used to generate the derived trees and the language of 
a synchronous grammar. 

It should be obvious that such a synchronous derivation tree can be trivially 
restated as a pair of standard derivation trees, further simplifying the defini- 
tion of synchronous TAG derivation. This leads to the following definition of 
synchronous TAG derivation. A derivation is a pair (Dl,Dr) where 

1. Dl is a well- formed derivation tree relative to Gj,. 

2. Dji is a well- formed derivation tree relative to Gr. 

3. Dl and Dr are isomorphic. That is, there is a one-to-one onto mapping 
/ from the nodes of Dl to the nodes of Dr that preserves dominance, i.e., 
if fijli) = Vr then / (parent^)) = parent(rj r ). 



10 



4. The isomorphic operations are sanctioned by links in tree pairs. That 
is, if f{qi) = rj r , then there is a tree pair (tree(rji) , tree(rj r ) , ---') in G. 
Furthermore, if r\i has a parent, then there is a tree pair 

(tree(parent(r)i)) ,tree(parent(ri r )) , ^~-) 

in G and addr{rji) addr{ri r ). 

This, then, is the most natural definition of synchronous tree-adjoining deriva- 
tion, as it is the natural generalization of the definition of derivation for standard 
TAGs. It merely requires that there be two derivations that are separately well- 
formed and appropriately synchronized as specified by the links. 

Several aspects of this definition are noteworthy. First, the derived tree pair 
for a derivation (Dl,Dr) is (T>(Dl), D(Dr)). Second, the definition does not 
require extra linking information specifying whether the link impinges on the top 
or bottom of the linked nodes. It is completely declarative; no vestiges remain 
of the rewriting definition. Finally, it solves the two problems of expressivity 
and implementability mentioned above, as described in the next section. 



4 Advantages of the Natural Definition 

We show in this section that the natural definition of synchronous derivation 
solves the two problems described in Section 2.4. 



4.1 Expressivity 

Under the revised definition of synchronous derivation, only tree- adjoining lan- 
guages can be expressed by a synchronous TAG. To see why, we look first at 
the problematic example of Figure [|, and then turn to a general argument. 

Under the new definition, adjoining constraints are no longer inherited in 
an overall derived tree being generated incrementally in the flat rewriting pro- 
cess. Rather, they apply to the auxiliary trees that directly adjoin to the node. 
Thus, in the grammar of Figure ^, the links in the auxiliary trees can never be 
operated on. For instance, the link in (3\ requires fii to be adjoined there, but 
its corresponding left half cannot adjoin at the left end of the link. Similarly, 
the link in 02 is useless as well. Thus, the only well-formed derivation is the 
one with no adjunctions whatsoever; the language of the grammar includes the 
single string pair (e, e) generated by its initial tree pair. 

In general under the revised definition, the left-projection language, say, 
of a synchronous TAG is specifiable by a pure TAG by simply mapping any 
adjoining constraints on the right trees to corresponding ones on the linked 
nodes on the left and projecting the grammar on its left component. (The 
example of Figure ||, so projected, is the normal TAG given in Figure which 
specifies the language containing only the empty string as expected.) 
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Alternatively, the TAL nature of synchronous TAGs under this definition can 
be easily shown by reduction to tree-set-local multicomponent TAGs (MCTAG), 
which are known to generate only tree-adjoining languages.^ Each elementary 
tree pair in the synchronous TAG corresponds to an elementary tree set in the 
MCTAG. To ensure that left-hand trees are not adjoined into right-hand trees 
and vice versa, the node labels on the left- and right-hand trees are uniformly re- 
named apart. Each node in a left-hand tree is marked with a selective adjoining 
constraint that allows adjunction only of certain elementary tree sets. For each 
link that impinges on the node, and each tree pair that can operate on that link, 
the corresponding tree set is allowed by the SA constraint. Similar constraints 
are added to each right-hand node. Finally, for each pair of nonterminals that 
root the trees in an initial tree pair, a new elementary tree is constructed rooted 
in a new nonterminal symbol not used elsewhere with two nonterminal children 
labeled by the left and right root nonterminals of the initial tree pair and which 
are to be filled by substitution. 

Since any synchronous TAG can be reduced to a tree-set-local MCTAG, 
the languages generated by synchronous TAGs are at most the tree-adjoining 
languages. The converse inclusion is obvious. 



2 The observation that synchronous TAGs under the new definition should be reducible to 
MCTAG was brought to our attention by Owen Rainbow. 
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4.2 Implementability 



Another advantage of the new definition of synchronous derivation is in its 
utility for implementation of synchronous TAG transducers. Recall that under 
the rewriting definition, the structure of a synchronous derivation bears no 
uniform relationship to the kind of derivation postulated for standard TAGs 
and therefore recovered by standard TAG parsing algorithms. Thus, the second 
step in the schematic process 

wl — > D L — > Ds — ► WR 

is not well defined. Under the natural definition, however, the synchronous 
derivation Ds is just (Dl,Dr). This close relation between a synchronous TAG 
derivation and derivations for the left and right projected grammars makes syn- 
chronous transduction straightforward. Any method for parsing that generates 
a standard derivation tree for a grammar can be applied to parse a string wl 
relative to the left projection grammar. The resultant derivation is isomorphic 
to the derivation tree for the right projection grammar, where the mapping is 
given directly by the synchronous grammar. The right projection derivation is 
thus easily constructed, and the corresponding derivation tree can be computed 
directly. Schematically, the process looks like this: 

wl-^Dl^ D s (= (D l ,Dr)) -^Dr^wr 

This methodology applies even under the view of synchronous TAG deriva- 



tions to be described in Section 5T. For instance, Schabes and Shieber (1994 ) 
describe a parsing method for standard TAGs that can be used to construct 
derivation trees on the fly while parsing. A simple modification of the method 
can construct the isomorphic derivation tree for the object grammar of a trans- 
duction. In fact, this redefinition has allowed for the first implementation of 
synchronous TAG processing, due to Onnig Dombalagian. This implementation 
was based on the inference-based TAG parser that we have presented elsewhere 
( |5chabes and Shieber, 1994| ). 



5 Problems with the Natural Definition 

Along with the advantages of the new definition of synchronous TAG derivation, 
new problems are introduced as well. First, the exclusion of multiple adjunctions 
at a single address is problematic for synchronous TAG derivations. Second, 
the isomorphism requirement between the derivation trees may be too strong 
as well. The former problem admits of a straightforward solution, which we 
describe below. The latter does not; we describe the symptoms of the problem 
but leave its resolution as an open issue for further research. 
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/^(intentionally) /J(twice) /^(intentionally) 

L R R 

Figure 8: Derivation tree pair for the grammar of Figure |l|. 

5.1 Multiple Adjunction 

Consider the synchronous TAG analysis of the semantics of adverbs given in 
Figure 0. This grammar is intended to allow for the ambiguity of strings such 
as 'John intentionally blinked twice' as shown in Figure ||. As previously men- 
tioned, the ambiguity arises from the ability to perform two rewriting steps at 
the same node, the root F node in the elementary tree a(blink)ft corresponding 
to the word 'blinked'. Under the natural definition, however, this would entail a 
derivation tree pair of the geometry given in Figure |^. But the right derivation 
tree is ill-formed, as it violates the prohibition against multiple adjunctions at 
a single address. 

It was the desire to model semantic ambiguity through violations of the 
prohibition that led us originally to a rewriting — as opposed to a derivation 
tree — approach to defining synchronous TAG derivation. Thus, the deviation 
from the natural definition of synchronous derivation was necessary because we 
required the ability of two elementary trees to be adjoined at the same node. 
Unfortunately, the rewriting interpretation of TAGs is a very inelegant way 
in which to get this ability, leading as it does to the problems described in 



Section 2.4. Nonetheless, without this ability, the utility of synchronous TAGs 
is severely diminished. 

For quite separate reasons, Schabes and I have been examining alternatives 
to Vijay-Shanker's definition of TAG derivation so as to allow for multiple ad- 



junctions of certain auxiliary trees at the same node. Our solution (Schabes and 
|3hicber, 1994 ) divides the class of auxiliary trees into two types, modifier trees 



and predicative trees, of which only the former allow such multiple adjunctions. 
In Vijay-Shanker's definition of derivation, a derivation tree is well-formed if 
no two auxiliary trees are adjoined at the same node in the same tree. In our 
revised definition, a derivation tree is well-formed if no two predicative auxiliary 
trees are adjoined at the same node in the same tree. Furthermore, so as to 
determinately specify a derived tree, all modifier trees that are adjoined at the 
same node in the same tree are ordered with respect to one another. Figure ^| 
shows the interpretation, in terms of derived tree (^|b) , of a derivation tree (|]a) 
with multiple adjunctions at a single node. In essence, this diagram gives the 
interpretation of the operation I[Ax/ti, . . . ,Ak/tk] when the t& are not disjoint. 
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Figure 9: Interpretation of derivations with multiple adjunctions at a single 
node. In this case, several modifier trees M\ through have been adjoined 
at node t in tree T, along with a single predicative node P. The derived trees 
associated with P and Mi through Mk, namely Ap and A\ through Ak appear 
in the derived tree in that order. 

The existence of the revised definition of derivation vitiates the argument for 
the flat definition of synchronous TAG derivation. Rather, a direct definition 
is now possible along the previous lines. The only difference is that Dl and 
Dr are taken to be well-formed derivation trees of the new variety. Taking the 
trees (3(twice) r and j3 (intentionally) r to be modifier trees, the synchronous 
derivation in Figure |^ is well-formed. The two possible orderings of the child 
nodes adjoining at address e provide for the two readings of the ambiguous 
sentence. 

5.2 The Isomorphism Requirement 

A potentially more severe (and certainly more subtle) problem results from the 
requirement of isomorphism between Dl and Dr. There seem to be certain ap- 
plications of synchronous TAGs for which this requirement is too strong. In this 
section, we present a taxonomy of potential counterexamples to isomorphism, 
organized by the "shape" of the nonisomorphic part of the mapping between the 
derivation trees. The examples are drawn from both technological application 
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of synchronous TAG to the problem of defining translations between languages 
and application of synchronous TAG to the modeling of natural language se- 
mantics. It may turn out that different applications provide different amounts 
of pressure to loosen the isomorphism requirement in differing ways. Although 
we discuss several possible approaches to resolving this issue, we leave to further 
work whether a satisfactory solution for a given application can be found, and 
if so, what that solution might be. 



Many-to-One Mappings 




The simplest examples are cases in which an atomic construction in one 



language is compound in another. For example, Abeille, Schabes, and Joshi 
(1990D point out that the English adverbial 'hopefully' is translated by the 



French phrase 'on espere que'. Whereas the English corresponds to a single 
elementary tree, the French corresponds to a tree derived by substituting the 
elementary tree for 'on' as the NP argument of 'espere'. Such examples argue 
for the ability to allow the mapping between the left and right derivation trees 
to be relaxed from a strict isomorphism. 

One might think (as indeed the present author did before penetrating dis- 
cussions with Anthony Kroch) that a mismatch such as this shows that the 
isomorphism requirement must also be too strong for the purpose of modeling 
natural language semantics, for if these two constructions — 'hopefully' and 'on 
espere que' — have the same semantics, then at least one of the two (if not 
both) must exhibit a mismatch between the natural language derivation and 
a derivation of its logical form. The error in this reasoning follows from the 
assumption that the relationship of "corresponds as an appropriate translation" 
(in the sense in which bilingual dictionaries record such facts) is tantamount to 
"means the same as" . This assumption is highly suspect. Bilingual dictionaries 
do not codify perfect translations in any sense, if such a notion is even coherent. 

However, mismatches of this variety may also be found in applications to di- 
rectly modeling natural-language semantics. For instance, the transduction re- 
lationship between a compound idiom (such as 'kick the bucket') and its atomic 
semantics (given, e.g., by a simple predication of die) might be thought to be 
of this form. 
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Elimination of Dominance 




Even when the number of nodes in the paired derivation trees is the same, 
they may exhibit different structure. Nodes participating in a domination rela- 
tionship in one tree may be mapped to nodes neither of which dominates the 
other. 

Abeille (personal communication) has noted a potential example of such a 
mismatch. For instance, in the sentence 

(1) Le docteur soigne les dents de Jean. 
The doctor treats Jean's teeth. 

the subphrase 'de Jean' is substituted into the 'dents' tree syntactically, and 
arguably modifies the semantics of that tree as well. However, the cliticizcd 
version of the sentence 

(2) Le docteur lui soigne les dents. 
The doctor treats his teeth. 

involves syntactic adjunction of the clitic 'lui' in the tree for 'soigne', although 
the translation into English, as before, places the pronoun within the object NP 
of the verb. Schematically, the derivation trees should show the geometry given 
in Figure [l0| Note that the separate derivations are not isomorphic; a sibling 
relation in one tree corresponds to a domination relation in the other. 

Again, examples may be found in the arena of semantic interpretation. Al- 
though the argumentation is much more complex, and well beyond the scope 
of this paper, similar relationships arise in the context of modeling quantifier 
scope ambiguity. 



Inversion of Dominance 




An even more extreme relationship, in which domination relationships are 
not only introduced but actually inverted, is exemplified by the French sentence 
and its English translation given in (|), and discussed by |Whitelock (1992fl . 
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a(lui) 




a( soigne) 




a(dents)^- a(teeth) 
1 

^ a(his) 



a( treats) 



2.1 




Figure 10: Schematic derivation tree pair for example (Q). The arrows show the 
required mapping between the derivations, which is not an isomorphism. 



Figure 11: Schematic derivation tree pair for example (|3|). The arrows show the 
required mapping between the derivations, which is not an isomorphism. 

(3) Jean monte la rue en courant. 
John runs up the street. 

In this example, the phrase 'en courant' adjoins as an adverbial modifier to 
the verb 'monte'. Presumably, 'en courant' would be paired with the English 
'runs' and 'monte' with the English 'up'. But the derivation tree for the English 
sentence would not then have the isomorphic structure in which 'runs' adjoins 
or substitutes into 'up', at least under the most natural analysis. Rather, the 
converse should hold; 'up' should be inserted into 'runs'. Figure [ll] shows the 
derivation tree pair schematically, including the nonisomorphism mapping be- 
tween the trees. 

We know of no example of inversion of dominance in applications to natural- 
language semantics. 

5.3 Relaxing Isomorphism 

In many of the above examples, although the mapping among derivation nodes 
is not an isomorphism, the deviation from isomorphism is nicely bounded, so 
that they could be well handled by allowing bounded subderivations to be con- 
sidered elementary for the purpose of defining the relationship between the 
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trees. In using synchronous TAGs as a model for language translation, that is, 
essentially to specify a bilingual lexicon, it is not surprising that bounded sub- 
derivations in one language are paired as a whole with bounded subderivations 
in another. Indeed, this is the modus operandi for traditional bilingual dictio- 
naries. The Harper/Collins/Robert English- French dictionary provides an entry 
for 'to [run] down/in/off' with translation 'descendre/entrer/partir en courant' 
essentially providing the mapping between the pertinent subderivations. Sim- 
ilarly, the pertinent entry under 'hopefully' specifies the translation of '[hope- 
fully] it won't rain' as 'on espere qu'il ne va pas pleuvoir', providing implicitly 
the subderivation mapping of 'hopefully' in its presentential position with 'on 
espere que'. For the most part, placing the isomorphism at the level of certain 
primitive and bounded subderivations is plausible, sufficiently expressive,^ and 
retains the advantages described in Section ||. 

If further relaxation of the isomorphism requirement is to be allowed, some 
method of controlling the relationship between the pair derivations will be 
needed. Owen Rainbow and Giorgio Satta (personal communication) have con- 
jectured that an approach along the lines of control grammars might be useful. 
This possibility, though tantalizing, remains to be explored. 

Whitelock's method of "shake-and-bake" translation (Whitelock, 1992), un- 
der which translation involves reusing the same components but under different 
relationships, seems to correspond to a version of synchronous TAGs in which 
there is no constraint on the geometries of the derivation trees, the only re- 
quirement being that they are constructed from paired elements. This extreme 
version of relaxing the isomorphism requirement may in the end be necessary. 

The exact nature of the relationship between paired derivation trees must 
remain for future work. 



6 Conclusion 

We have introduced a simple, natural definition of synchronous tree-adjoining 
derivation, based on isomorphisms between standard tree-adjoining derivations, 
that avoids the expressivity and implementability problems of the original rewrit- 
ing definition. The decrease in expressivity, which would otherwise make the 
method unusable, is offset by the incorporation of an alternative definition of 
standard tree-adjoining derivation, previously proposed for completely separate 
reasons, that allows for multiple adjunctions at a single node in an elemen- 
tary tree. The increased flexibility from the ability to perform such multiple 
adjunctions makes it conceivable to entertain using the natural definition of 
synchronous derivation. Nonetheless, some remaining problematic cases call for 
yet more flexibility in the definition; the isomorphism requirement may have to 

3 The French clitic example, however, remains problematic. The relation between the clitic 
and the NP which it is semantically related to seems to be potentially unbounded. 
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be relaxed. It remains for future research to tune the exact requirements on the 
allowable mappings. 
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